ED 137 782 



CS 203 271 



AUTBOE 
TITLE 
PUB DATE 
NOTE 



EDRS PRICE 
DESCEIPTOES 



Bay, Libbi; HcCullcchi. Elizabeth' 
Towards Oniforaity in Grading Standards* 
76 

3p.? Paper presented at the Annual Eeeting of the Nev 
York State English council (26th^ October 15-17, 
1976) 

M?'$0.83 HC-$1.67 Plus Postage. 

Academic Standards; College Freshmen; Coamunity 
Colleges; ^Composition (Literary) ; ^English 
Departments; ^English Instruction; Essay Tests; 
^Expository Writing; ^Grading; Junior Colleges 



ABSTRACT 

To study grading standards and consistency vithin the 
English department, 1600 freshmen at Eockljand Community Colle4ie were 
asked to complete a uniform exit essay at the end of English 101. 
After developing criteria for grading the papers, members of the 
department marked their o^n papers and one othar set. Eight months 
later^ 240 of the papers vere regraded by the original instructor, in 
order to assess self -consistency in marking. Comparison of final 
grades, essay grades assigned by the instructor, and essay grades 
assigned by the disinterested marker suggested that there was a 
general consistency in grading throughout the department; the papers 
that were regraded eight months later showed a similar consistency 
for individual staff members. The exit essay experiment was felt to 
have been worthwhile, in part because of the cooperative effort 
involved in carrying it through. This led to an awareness of what the 
department grading standards were, of the extent to which they were 
being followed, and of the way in which individual grading policies 
compared with those of colleagues. (AA) 



* Documents acquired by ERIC include many informal unpublished * 

* materials not available from other sources.' ERIC makes every effort * 
^ to obtain the besh copy available. Nevertheless, items of marginal * 

* reproducibility are often encountered and this affects the-' quality * 

* of the microfiche and hardcopy reproductions ERIC makes avkilable * 

* via the ERIC Document Reproduction Service (EDRS) . EDRS isj not * 

* i;esponsible for the quality of the original document. Reproductions * 

* supplied by EDRS are the best that can be made from the original.' * 



jCeo exactly as received prom 

- ~ f*£RSON OR ORGANIZATION ORiGlN. 
ATtNG IT POINTS OP VIEW OR OPINIONS 
S7A-£D DO HOT NECESSARlt^ REPRE- 
SENT OPFtCiAL NATIONAL INSTITUTE OF 
EOUCAT'ON POSITION OR POLICY 

Towards Uuiforoi^ In Grading Standards 

Libby Bay and Slizabedi McCulloch 

Like the weather, grading is a subject everybody talks about but nobody does 
anything. Two years ago the English Department at Rockland Cousnunity College de« 
(^ided that, dire predictions to the cantrary^ we would seed our acadeislc clouds 
and ©ee if we could work towards some predictability in grading. 

Our concern began with the depressed state of student writing and the elevated 
state of student grades. Sixty percent of our students earned recognition on the 
Dean's List; more than fifty percent of our freshmen received A*s and B's in their 
Eagllsh courses. Yet, somehow, these statistics did not jibe with our gut feelings 
about student accomplishments, especially in Engligh«*nor with what our eyes saw 
as we looked at student writing. Therefore, we made Freshman English, more partic* 
uiarly the grading of student . themes, our special agenda for the next two years* 

Our first venture was a grading workshop. The entire department came together 
to review five papers. When we were flnished-*and far apart on at least one-«ws 
argued the criteria we had used. 

. - i 

Obviously, five essays provided us with limited data and questionable rs/stilts. 

I 

Thus we decided to experiment with cooperative grading on a much larger sc^^Ie by 
giving a uniform exit essay at the end of English 101 In January 1976 to approxi- 
mately 1600 freshmen. This move was a bold one since ours is a departjaant where 
freedom has always been the hallmark. We have no standard texts, no departmental 
tests, very little admlnlfitrative supervision. We have always worked from the 
assumption of professional integrity and responsibility and left major decisions 
about class conduct to indivldxxal instructors. Thus we eiqphaslzed that this 

venture was only an experiment, that we had no predetermined results In mind, and 

■ I. 

that the essay would have no effe<!t on course grades unless the teacher so chose. 
A committee of three who were not teaching EN 101 that semester was selected 



W> WCrAViWfeNI UF- HEALTH 
EDUCATION ft WELFARE 
NATIONAL INSTITUTE OP 
EDUCATION 



Page 2 



to vork oat a set of grading criteria to be presented to the departrsant and to 

choose the essay to vhich the students would respond, (We ha.d uaaninK)U3l7 da- 

cided the exandnatton vould take the form of en expository essay to be based on 

a short reading*) Thts Coraaittee developed specific considerations for grading 

pap^tjrs and^after the tjsual expressions of individual dissent, the following 

criteria were agreed upon, in rank order: 

Content 

Organization 

Paragraph development 

Sentence structure 

Logic 

Usage 

Agreement and reference 
Point of view 
Transitional devices 
Punctuation 
Spelling 

It is interesting to coapare these criteria with those revealed in a study 
by the Educational Testing Service. In titat project, fifty-three distingtiished 
readers, including ten college English teachers^ nine college social science | 
teachers, eight college natural science teacuu^rf^ ten writers and editors, nine 
lawyers and seven, business executives graded three hundred freshmen themes* The 
scale they developed ranged, in ^:ank order$ from i^eas (like our content first on 
the list), to mechanics (usage, punctuation and spellings /hich we placed towards 
the bottom of our priorities), organization and analysis (somewhat higher on our 
scale), phrasing (which, oddly, is not really covered in our criteria), and 
"flavor" (style, individuality, i^uLarest, sincerity,-- •characteristics which we 
felt unmeasurable, but ^ich obviously beconoe detL;innining factors in distin- 
guishing between an "A" and a "B" paper )»'' 



Paul Diederich, John V. French, and Sydell T, Carlton, Factors in 
Judgments of Writing Ability (Princetons ETS, 1962). 



ERLC 



?a^e 3 

Then the cotasdttee locked through nany essays,^ primarily from the Op Ed 
pages of the New York Tiases because these sees>ed tisialy, provocative and "properly 
sized,." A satirical piece by Russell Baker entitled ^'School vs, Education" was 
chosen* The selection was, of course, kept secret, but during the semester ve 
distributed essays of a similiir type for students to discuss and write about* 

We arranged with the registrar of the college to schedule all the 101 exaias 
at the same time so that no student would have an unfair edge. Instructors were 
aaked to do their grading (from A through F) that evening, making no marks on the 
'Students* papers, but recording the results on a roster sheet • When they turned 
in their papers to the Department Secretary the next morning, they were to pick 
up a "strange" set to grade. All English teachers, full and part-time, who were 
not teaching a section of 101 that seciaster, were also asked to mark at least 
one set of papers* Thus everybody in the department participated in the project* 

Every student's paper, then, was seen by two teachers, his home instructor 
and a disinterested marker j two grades were recorded side by side on individual 
roster sheets along with the course grade and a notation of whether the teacher 
had averaged the exit essay into that semester grade in any way* Thus we de- 
veloped a bank of information from which we hoped, with the help of our campus 
conyuter center, to draw information, primarily on grading consistency. 

It took a while to gather the information, computerize the results, and 
examine their implications. In fact, while we were waiting, the Committee re-- 
quested one other cooperative effort from the de; :rtment. In September 1976, 
each full-time instructor was asked to grade again ten papers that he had done 
last January to test the element of self-consistency. 

Naturally, we were interested in the findings of the computer. We realized 
that statistics, like the bed of Procrustes, can be adjusted to accommodate what- 
ever degree of whopper we are attempting to project. Just luckily, in working out 

4' 



our Exit Essay project, however, we were not trying to validate a pre-detemined 

notion^ Rather, we were siiaply exploring an idea. Whatever news uhe coiiiputer 

chose to deliver, we were willing to accept. And what it finally delivered v'as, 

of course, perfect flooda of data which, in sunaaary, gave us the basis for future 

discussions and decisions. 

In all, 1569 students took the test. Of these, 

96iX were passed by their own instructors; 
94%7. were passed by disinterested markers, 

3%X were failed by their o^/n ins trtxc tors; 
5%% were failed by disinterested markers, 

received A^s and B*s from their own instructors; 
291 received A' s and B's from disinterested markers, 

46% received C's and D's from their own instructors; 
^ 53% received C's and D'a from disinterested markers. 

Because of the size of the sample, these figures reveal a predicted l^vel of 
statistical significance. Other researchers, notably Richard Braddoek have^tsj* 
covered a similar lack of correlation among readers of the bojoq composition. 

Also predictable was the discovered tendency on the part of the home instructor 
to grade higher than the disinterested marker who, naturally, had no personal in- 
terest in or knowledge of the student whose paper he was grading. 

Stilly for our purpose which, primarily, was to find out whether there was 
consistency in composition grading in our department, the experiment answered well. 
If almost 95% of students wiio were marked by disinterested markers passed the test, 
nearly a third with A' s and B's, students were learning, and the disparity between 
the instructor's marks and the anonymous grader's marks w&s, on the whole, slight. 

On the m>.bject of disparity in grading amongst department numbers, we dis- 
covered that the disinterested marker graded one grade lower than the home 

•i: ' ' 5 , 

2 ■ ■ ' - 

Richard Braddoek, Richard Lloyd- Jones, and Lowell Schoer, Research In 
Written ComoslClbn (Champaign, III,: NCTE, 1963). 



Page 5 

instructor on 27% of the papers end two grades lower on 9% of the papers, ^ereas 
the hoE^ instructor graded one gyade lower than the disinterested marker on 17% 
of the papers and two grades lover on The disinterested marker failed 5% of 

papers not failed^ by the hoae teachers, whereas the hoo^ teacher failed 2%% of 
papers not failed by the disinterested marker. 

We felt that these differences were not really significant, allowixig for^ as 
they seeaad to, the subjectivity of individual instructors and the lack of a 
personal factor in the grading by the disinterested markers* 

Now, how did the grades the students received for the course compare with the 
grades they received in the E:sit Essay? 

S5%% of students who passed the Exit Essay as graded by their home teachers 
were also passed for the sesnester* 94%% of students who passed the Exit Essay 
as graded by the disinterested marker also passed the course, a difference of 1%» 
Thtis, it £;ppeared to us that not only was there a general consistency in grading 
throughout the department, but that the instrument chosen was a fair measure of 
the diverse approaches we used in Freshman English 101* 

A further measure offered some insight Into our own performances as graders* 
Aa previously mentioned, in September of this year, 240 Exit Essays, written the 
previous January, were distributed to twenty-four teachers, ten papers each* 
Each of the participating teachers had previously graded the same ten papers, and 
were asked to re-grade them without being told what marks they had given them in 
January* 

Of the 240 papers, 125 were graded exactly as before* 

55 were marked one or two grades higher the second time 
around* 

57 were marked one or two grades lower the second time * 
around* 

Of the 9 papers which failed on the first grading, 8 of 
them failed again on the second rcundo 



Page 6 



Here again, a t-test, which was not done, might have shown a certain signif- 
icant difference between the grading standards used by individual teachers on 
separate occasions. But, certainly, the disparity did not seem to indicate 
undue capriciousness or whimsy on the part of the teachers. In factj» the 
very closeness of the first and second gradings rather points to a kind of 
built-in consistency among individual staff members* 

Jtost of the departnsnt members feel that the Exit Essay experiment, was 
worthwhile. Such studies, of course, have been done before, with soo^what 
similar findings, and we could have simply absorbed these. The proof, hew- 
ever, often lies in the doing. In the cooperative struggle, we learned 
something about ourselves, our values, our attitudes towards our students, 
our points of agreement and disagreement. We were pleased to discover that 
all of us, working in our different ways, are moving in the same general 
direction, using similar standards, attempting related goals. Despite the 
good feelings, the notion of an Exit Essay as standard end-of-nhe-semester 
procedure, however, was greeted with reluctance by the department. 

Some members feel that making the Exit Essay a permanent, mandatory part 
of the Frgshman English curriculum will lead to standardiz^^tion of the course, 
to lock-step teaching for the exam, to an invasion of professionalism, and to 
an intrusion of privacy. Others, more practical, see the problem as one of 
difficulty in choosing a suitable essay. Russell Bii^xer's essay, was conceived, 
by several teachers to have been a poor selection. Some thought that Baker's 
satire took unfair advantage of the students' lack of sophistication; others 
that the question called for too much re«capping, a writing device which 
composition teachers attempt, often vainly, to train out of their students. 
Yet, even if a comoaon Exit Essay is never again undertaken at Rockland 

7 



Page 7 



Cocsainity College, we hsve had a consciousness-raising experience and developed 
an awareness of what departn^nt standards are, whether we are following them, 
and how our individual grading policies compare with our colleagues* 

This e^eriment was deliberately limited in scope and made no atteispt to 
tackle far more important questions: are our students learning to think and 
write during the year of freshman composition? how different are their ideas 
and expression when they leave from when they came? has their humanity, in 
some way, been touched by their stay with us? 

Our attempts to move towards uniformity in grading standards, teioporarily 
coinpleted, have, perhaps, cleared the way for this other two-year project! 



